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THERMOSTABLE POLYMERASE DNA SEQUENCING 
REACTION CONCENTRATE 
Field of the Invention 

This invention relates to DNA sequencing, a 
5 reaction concentrate for sequencing, a primer for the 
sequencing and a kit comprising the reaction 
concentrate. 

BACKGROUND OF THE INVENTION 
There are several method for sequencing 

10 DNA . One of the most frequently used is that of 
Sanger et al reported in Froc . Natl. Acad. Sci. USA 
volume 74, No. 12, pp. 5463-5467. Sanger's method is 
based on the inhibitory activity of nucleotide 
analogues such as dideoxy nucleotide triphosphates 

15 (ddNTPs where N is thymine, guanine, cytosine or 

adenine) on the ability of DNA polymerase to catalyze 
the growth of a oligonucleotide chain. 

For example, when a primer and template are 
incubated with DNA polymerase in the presence of a 

20 mixture of ddTTP and dTTP, as well as the other three 
deoxyr ibonucleos ides triphosphates (dNTP wherein N is 
G, A and C) one of which is labeled with a radio- 
active nucleotide, a mixture of fragments will be 
synthesized all having the same 5' end with ddT 

25 residues at the 3' end is obtained. When this 
mixture is fractionated by electrophoresis on 
denaturing acrylamide gels, the pattern of bands show 
the distribution of dTs in the newly synthesized 
DNA. By using analogous terminators for other 

30 nucleotides in separate incubations and analyzing the 
samples in parallel on a gel, a pattern of bands is 
obtained from which the sequence can be deduced. 

In general, a DNA fragment to be sequenced 
is spliced into a particular site in the DNA of a 

35 sequencing vector, such as M13 phage, using standard 
recombinant DNA splicing methods . The resulting 



WO 90/08839 



PCT/US90/00440 



recombinant phage DNA is used to infect bacterial 
cells, and virus particles are produced. These 
particles contain only one of two strands of DNA. 

The single strands of DNA are purified from 
5 the viruses. They are mixed with pieces of DNA 

referred to as primers. Primers are complementary to 
a region of the M13 DNA near the position where the 
DNA to be sequenced has been inserted. The primer 
forms base pairs with the M13 DNA creating a short 

10 double-stranded region of DNA that serves as a primer 
for synthesis. A large number of these partially 
double— stranded DNA molecules are mixed with DNA 
polymerase and a radioactive dNTP. DNA polymerase 
synthesizes radioactive DNA from the M13 template 

15 (beginning at the 3' end of the short double-stranded 
region). The synthesis ultimately involves the DNA 
region that is to be sequenced and which acts as a 
template . 

The synthesis can be terminated by including 

20 a ddNTP in the biochemical reaction. If the ddNTP is 
ddTTP, the new radioactive chain will stop after it 
would normally have added a T. Normal dTTP is 
included in the reaction mix to allow some DNA 
synthesis before a ddTTP stops the synthesis. Since 

25 there are many T's in a nucleotide sequence the stops 
will not always be in the same place. Thus by 
carefully adjusting the amount of analog in the 
reaction it is possible to create a collection of 
radioactive DNA fragments that begin at the 3' end of 

30 the primer and stop at the various positions where 

T's occur. This procedure is repeated in three other 
test tubes comprising all of the necessary dNTPs and 
an analog ddGTP, ddCTP or ddATP. This results in 
four separate collections of DNA fragments. Every 

35 fragment in each tube starts at the same place and 
ends at different distances in a G, A, T or C 
dependent on the specific ddNTP included. 
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The lengths of the molecules are measured by 
gel electrophoresis under conditions that are so 
precise that they can separate DNA sequences 
differing by only one nucleotide. The four separate 
5 collections are electrophoresed next to each other 
and after electrophoresis a photographic film is 
exposed to the radioactive emissions of the DNA in 
the prepared electrophor etic gel. By analysis of the 
developed film one is able to read the DNA sequence 
10 directly from the photographic film. 

This method normally involves many different 
steps that must be carefully carried out in order to 
achieve precise and accurate sequence analyses. The 
elimination of any of the steps would improve the 
15 method. For example, the method as generally 
performed requires separate mixing of all the - 
chemical components involved at the time of use. 
That is the buffer, enzyme » nucleotides, nucleotide 
analogues and oligonucleot ide primer are mixed 
20 together separately for each nucleotide G, T, A and C. 
SUMMARY OF THE INVENTION 
The present invention provides a nucleotide 
sequencing reaction concentrate comprising: 

a) sufficient thermostable polymerase to 
25 provide from 100 to 500 I.U./mL; 

b) from 0.3 to 30 mM of a reducing agent 
selected from dithiothreitol and B-mercaptoethanol ; 

c) sufficient phosphate buffer to maintain a pH 
of about 7.5; 

30 d) at least 40% glycerol; 

e) from 15 to 150 uM dGTP or daGTP (where "a 
G" represents 7-deazaguanine ) ; 

f) from 15 to 150 uM dATP ; 

g) from 15 to 150 uM dTTP; 

35 h) from 10 to 18 dCTP; and 

i) one member selected from the group 
consisting of: 
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i) sufficient ddGTP to provide a ddGTP to 
dGTP or daGTP ratio of about 5; 

ii) sufficient ddATP to form a ddATP to 
dATP ratio of about 33; 

5 iii) ddTTP in a sufficient amount to form a 

ddTTP to dTTP ratio of about 28; and 

iv) ddCTP in a sufficient amount to form a 
ddCTP to dCTP ratio of about 22. 

Use of the reaction concentrate of this 
10 invention eliminates the need to mix all the 

components needed for sequencing a nucleotide chain 
at the time of use. The reaction concentrate 
comprises all of the essential components in a 
premixed package that can be stored and reused during 
15 several cycles. One advantage of the reaction 
concentrates is. the simplicity of the method 
involving execution of the sequencing reaction 
steps. Such steps involve the fewest number of steps 
of all current sequencing procedures and approaches a 
20 desirable "single— step" protocol for sequencing. 

In a preferred embodiment the reaction 
concentrate comprises, in addition to the components 
listed above: 

a) from 175 to 700ug/mL of concentrate of a 

25 protein selected from the group consisting of gelatin 
and bovine serum albumin; 

b) at least 25 mM of potassium glutamate; 

c) from 0.1% to 0.35% V/V of a non-ionic 
surfactant; and 

30 d) from 9 to 35 mM of a magnesium salt such as 

magnesium acetate or magnesium chloride. 

In this embodiment the reaction concentrate 
can be stored at temperatures as low as -80°C to 
-20°C without any deleterious effect on the ability 

35 of the components to react in the chemical reactions 
involved in nucleotide, sequencing. 
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In practice, the dCTP concentration in the 

reaction can be reduced to 3yM and still support 

sufficient enzyme reaction kinetics. This is the 

only one of the four nucleotides that can be reduced 

5 to that level. Consequently, dCTP is the recommended 

target nucleotide used for radioactive labeling of 

the synthesized strand. Such a low concentration of 

the target nucleotide is essential for efficient 

35 

labeling, especially if the S thioester of dCTP 
10 is to be used. Other radioactive nucleotides can be 

used with substantially reduced efficiency. 

DET AILS OF THE INVENTION 
The essential components of the reaction 

concentrates are 100 to 500, preferably 250 I.U./mL 
15 of a thermostable polymerase, preferably Thermus 

aquaticus polymerase (Taq pel); dithiothreitol (DTT) ; 

phosphate buffer; magnesium salt; glycerol; the 

deoxynucleotide triphosphates dNTPs wherein N is 

guanine (G) , 7-deazaguanine (aG), adenine (A), 
20 thymine (T) and cytosine (C); and a ddNTP (dideoxy 

NTP wherein N is as previously defined) to terminate 

the G, A, T and C reactions. 

In one embodiment this invention provides 

four identical reaction concentrates differing only 
25 in the concentration of each dNTP, the specific ddNTP 

and the ratio of ddNTP to dNTP . The specific ddNTP 

would depend on which nucleotide is designated as the 

terminating nucleotide in any polymeric sequence. 

According to the Sanger method that would be G, A, T 
30 or C. 

In addition to the foregoing essential 
elements, reaction concentrate can optimally contain 
a) a protein; b) a non-ionic detergent; and c) 
potassium glutaraate. 
35 The reaction concentrate can be made even 

more convenient to the ultimate user by including 
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therein a primer as well as the radioactive labeled 
dNTP. However, the short half life of radioactive 
dNTPs makes the inclusion thereof impractical. 

Reaction concentrates can be constructed 
5 that contain nearly all of the essential components 
to DNA sequencing. This presents the opportunity for 
a user to simply add a solution of DNA to be 
sequenced to a mixture of reaction concentrate, 
incubate the mixture at elevated temperature, 

10 terminate and analyze the reaction products as in 
contemporary practice. Such method eliminates many 
steps and is far more amenable to automation than any 
contemporary sequencing methodology. 

The thermostable enzyme is present in the 

15 concentrate in sufficient amount to provide an 
activity of about 0.5 I.U per base-specific 
reaction. I.U. is the amount of enzyme required to 
catalyze a polymerization of 10 micromoles of 
substrate nucleotide per 30 minutes under a broad pH 

20 range and a temperature of 70 Q C for this enzyme. 
As used herein, the term "thermostable 
enzyme" refers to an enzyme which is stable to heat 
and is heat resistant and catalyzes polymerization of 
nucleotides to form a nucleotide sequence that is 

25 complementary to the unknown nucleotide sequence. 
Generally, the synthesis will be initiated at the 3' 
end of each primer and will proceed in the 5' to 3' 
direction along the template strand thereby pro- 
ceeding into the unknown nucleotide sequence of the 

30 sequencing vector, until synthesis is terminated by a 
ddNTP, producing oligonucleotides of different 
lengths. There may be a thermostable enzyme, 
however, which initiates synthesis at the 5» end and 
proceeds in the other direction, using the same 

35 process as described above. 
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The preferred thermostable enzyme herein is 

a DNA polymerase isolated from Thermus aguaticus (Taq 

pol). Various strains thereof are available from the 

American Type Culture Collection, Rockville, 

5 Maryland, and is described by T.D. Brock, J, Bact. 

(1969) 18:289-297, and by T. Oshima, Arch. Microbiol. 

(1978) 117-: 189-196. One of these preferred strains 

is strain YT-1 . 

Our experiments have shown that Taq Pol 

10 1. does not react at low temperatures and can 

be stored in elevated salt and appropriate reaction 

buffer conditions at -20°C; 

2. is relatively free of contaminating 

nucleases and phosphatases; and 

15 3. maintains appreciably activity in relatively 

high concentrations of glycerol. These observations 

made the present reaction concentrates possible. 

The thermostable enzyme may also be produced 

by recombinant DNA techniques, as the gene encoding 

20 this enzyme has been cloned from Thermus aguaticu s 

genomic DNA. The complete coding sequence for the 

Thermus aguaticus (Taq) polymerase can be derived 

from bacteriophage CH35:Taq#4-2 on an approximately 

718 

3.5 kilobase (kb) Bglll-Asp partial restriction 
25 fragment contained within an -18 kb genomic DNA 
insert fragment. This bacteriophage was deposited 
with the American Type Culture Collection (ATCC) on 
May 29, 1987 and has accession no. 40,336. 
Alternatively, the gene can be constructed by 
30 ligating an -750 base pair (bp) Bglll-Hinlll 

restriction fragment isolated from plasmid pFC83 
(ATCC 67,422, deposited May 29, 1987) to an -2.8 kb 
Hin III-Asp718 restriction fragment isolated from 
plasmid pFC85 (ATCC 67,421, deposited May 29, 1987). 
35 The pFC83 restriction fragment comprises the 

amino-terminus of the Taq polymerase gene while the 
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restriction fragment from pFC85 comprises the 
carboxyl-terminus , Thus, ligation of these two 
fragments into a correspondingly digested vector with 
appropriate control sequences will result in the 
5 translation of a full-length Taq polymerase. 

These parameters were determined 
experimentally. Less than the 0.5 I.U. per base- 
specific reaction results in artificial termination 
of nucleotide chains. Greater than 2 I.U. per 

10 base-specific reaction is unnecessary. 

The protein can be bovine serum albumin 
(BSA) or gelatin. Taq pol enzymatic activity is 
improved by a protein environment. The protein also 
enhances the shelf life of the reaction concentrate. 

15 The presence of the protein also seems to avoid the 
inclusion of artifacts in a polymerized nucleotide 
sequence in which Taq pol catalyzed polymerizations. 
Amounts of 175-700ug/mL of protein in the reaction 
concentrate, preferably 350ug/mL, are desirable. 

20 Concentrations below 175ug/mL result in 

instability. Concentrations greater than 700pg/mL 
are unnecessary. 

A reducing agent is essential. 
Dithiothreitol (DTT) is preferred. It maintains the 

25 sulfhydryl groups of Taq pol in a reduced state. 
This reduced state is necessary for Taq pol 
activity. Other reducing agents include 
[3-mercaptoethanol . Amounts from 0.3 to 10 mM of 
DTT are useful, preferably 3.5 mM. Concentrations 

30 outside of this range result in diminished rates of 
polymerization. Concentrations above this range are 
unnecessary. 

Commercial preparation of Taq pol usually 
contains small amounts (about 0.1 mM) of EDTA as an 

35 aid for preservative purposes. However, more than 
ImM EDTA would be detrimental to the reaction 
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concentrate because EDTA is a metal scavenger and 
would therefore have an affect on magnesium ion 
activity which is essential for Taq pol activity. 

The reaction concentrate preferably contains 
5 up to 17.5 mM of a magnesium salt such as magnesium 
chloride or magnesium acetate (±507«). Magnesium 
acetate is preferred. It is essential that it is 
present at about a 5 mM concentration in the final 
reaction mix. Amounts in the ranges of ±507« of 

10 5 mM magnesium acetate will be effective. Above or 
below these ranges the magnesium will have a 
deleterious affect on the enzyme activity. 

Potassium glutamate (K glutamate) also 
enhances Taq pol enzymatic activity. It is essential 

15 for long term storage of the reaction concentrate. 
Long term storage requires a minimum amount of 25 mM 
of the potassium glutamate. Higher concentrations 
lead to an unacceptable salt concentration that would 
have a deleterious affect on the electrophoretic 

20 analysis carried out in the sequencing method 
provided by this invention. 

Potassium hydrogen phosphate and potassium 
dihydrogen phosphate (l^HPO^ and KH 2 P0 4 ) 
should be present in an amount sufficient to 

25 establish a pH of about 7.5. Phosphate buffer is a 
thermally stable buffer. It is able to maintain 
essentially the same pH over a temperature range of 
-20° to 75°C. It is necessary to maintain the pH 
constant in storage as well as in use so that Taq pol 

30 activity is constant from storage to use. 

Glycerol is an antifreeze. Repeated 
freezing of any enzyme must be prevented to maintain 
enzyme activity as the reaction concentrate goes 
through the expected cycles of thawing for use and 

35 then restorage for future uses. We have found that 
at storage temperatures of -70 8 C to -20°C at least 
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40% glycerol must be added to the reaction concen- 
trate to prevent repeated freezing. Up to 80% of 
glycerol could be used without adversely affecting 
Taq pol activity. 
5 The reaction concentrate can comprise from 

0.1 to 0.4% of a non-ionic detergent. Examples 
include ethoxylated fatty alcohol ethers and lauryl 
ethers, ethoxylated alkyl phenols, octylphenoxy 
polyethoxy ethanol compounds, modified oxyethylated 

10 and/or oxypropylated straight-chain alcohols, 

polyethylene glycol monooleate compounds, polysorbate 
compounds, and phenolic fatty alcohol ethers. More 
particularly preferred are TRITON X-100, Tween 20, 
from ICI Americas Inc., Wilmington, DE, which is a 

15 polyoxyethylated (20) sorbitan monolaurate, and 

TM 

Iconol NP-40, from BASF Wyandotte Corp., 
Parsippany, N J , which is an ethoxylated alkyl 
phenol (nonyl) . 

Each of the four dNTPs are also essential 

20 components of each concentrate. Each of the four 
concentrates must contain one, and only one, of the 
ddNTPs according to the termination reaction designed 
into each reaction concentrate package. 

In one embodiment the materials of the 

25 present invention can be provided to the ultimate 
user as a kit in several embodiments. Consistent 
with the spirit of this invention of user conven- 
ience, it is desirable to provide a kit in which the 
kit components can be used directly without dilution 

30 steps. Thus the most convenient kit would be 

designed to allow the user, based on the practice in 
many biochemistry labs , to take approximately 2uL 
of the reaction concentrate for direct use. 

In one embodiment the kit can comprise 

35 distinct and separately packaged chemicals which may 
be labeled 1) primer and 2) separate reaction 
concentrates G, A, T and C. 
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An optional user variable package can 
include a sequencing vector or a control DNA and a 
radioactive labeled dNTP. The control has a known DNA 
sequence which allows the user to confirm that the 
5 kit is operative. 

The sequencing vector comprises the 
nucleotide chain to be sequenced. The most widely 
used sequencing vectors have been derived from 
bacteriophage M13 . This vector is widely available 

10 commerically in various formats usually differing 
only by their multiple cloning regions. Widely used 
versions of this vector include Ml3mpl8 and mpl9. 
Other commercially available sequencing vector 
templates .include pUC , pIBI , Bluescript and others. 

15 If the primer described hereinafter is used, any 
sequencing vector having a lacZ nucleotide sequence 
can be used. 

Alternatively, the user may choose to supply 
the primer and not use the supplied primer. The user 

20 may also choose to independently label the primer 
with a radioactive or fluorescent label. In this 
situation, additional use of radioactive labels is 
unnecessary. As long as the primer concentration and 
binding ability is substantially unmodified, the 

25 reaction concentrations of this invention will be 
useful . 

Commercially supplied radioactive nucleo- 
tides are sold at a concentration range of about 3 to 
16 uM in water. This amount (about luL) is added 

30 as a user variable where that amount is then further 
diluted by adding a certain amount of the user 
variable part of the composition to each of 4 
different reaction concentrates. That is, the 1 \ih 
of radioactivity as a user variable is co-mixed with 

35 the user's DNA or control DNA t water and 5 uL of 

primer to total 20 to 23 yiL, and is divided 4 times ■ 



WO 90/08839 



PCT/US90/00440 



to end up with 2.5 uCi per 5 uL to which 2 \xL 

of reaction concentrate are added, giving four final 

reaction mixtures of 7 uL each. 

The required amount of radionucleotide for 
5 each of the 4 reactions is based on completing the 
sequence analysis in 24 hours with use of a preferred 

isotope. The preferred isotope is alpha-labeled 

35 32 

S dCTP although others such as P could be 

used. The preferred range would therefore be l—2\iL 

10 of alpha-labeled 35 S dCTP in the kit. This 

provides about lOuCi/uL at a specific activity of 

400-600 Ci per mM of 35 S dCTP. 

The user can supply the radioactive dNTP or 

it can be included in each of the reaction 

15 concentrates. When the user provides both the 

sequencing vector and the radioactive dNTP, this 

constitutes the user variables. 

One version of a user variable is presented 

below. These variables are supplied by the user. 

20 Optionally they may be supplied as part of a kit. . 

Concentration in 

Final Reaction 

User Variable Concentrate Mixture 

ljigM13mpl8 or a user selected 

25 sequencing vector amounting to 

0.4 pmoles of the 

DNA to be sequenced 0.1 pmoles /7uL 

IpL alpha-labeled dCTP 2.5uCi/7uL 

Plus water to a final 
30 volume not to exceed 18 uL 

The primer concentrate package comprises an 
appropriate concentration of an oligonucleotide 
primer in a volume of 5uL. 
35 Essential to Taq pol sequencing is the 

extension of an appropriate primer. Elevated 
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temperature of synthesis does not permit effective 
use of "universal primer A' , which is a 17 base 
oligomer described herein below. Using longer 
oligomers does permit "spontaneous" priming of 
5 template DNA at elevated temperatures, and using such 
an appropriate primer obviates the need for the 
priming reaction essential to other DNA sequencing 
methodologies. The phrase "spontaneous" means that 
the priming reaction occurs without intervention by 
10 the user. 

In a kit according to this invention the 
primer includes the commonly used universal primer 
region of M13mpl8, but also includes additional bases 
in the surrounding sequence, taking advantage of more 

15 stable G and C residues. This oligomer is compatible 
with all lacZ based vectors (M13 , pUC , pIBI , 
Bluescript and others). The oligomer sequence is: 
5' CCCAGTCCACGACGTTGTMAACGA£GGCCAGTG 3» 

The underlined region is the traditionally 

20 used universal primer A region. If primers shorter 
than 33 mer are used, excessive molar amounts are 
required for spontaneous priming. Such excessive 
amounts will result in non-specific priming leading 
to ambiguous sequencing results. 

25 Longer primers can also be used. However, 

they seriously compromise sequencing results. 
Extending in the 5' direction leads to G rich 
sequence inclusions, resulting in ambiguous electro- 
phoretic mobilities caused by nucleotide structure. 

30 Extending in the 3' direction interferes with the 

cloning sites of the M13 vector. A 3 molar excess of 
the above primer to target DNA gives full spontaneous 
priming. Greater than a 10 molar excess produces 
background on sequencing gels, interpreted as 

35 non-specific priming. Limiting the total amount of 
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primer to the specified 0.3 moles per 7^L reaction 
limits the total number of possible primed target DNA 
molecules. Hence, the user can use excessive amounts 
of DNA without depleting the nucleotide pools (up to 
0.3 pmoles of primer at 200 number-average bases per 
extension is 15 pmoles of dCTP consumption for an 
average unknown DNA sequence, whereas a 7uL 
reaction containing 3uM dCTP, amounts to 21 pmoles 
available for consumption), 

A preferred primer concentrate contains: 

Concentration in 
Final Reaction 

Mixtnrp 

1 mW Tris/HCl (pH8) 0 .18 mM 

15 0.1 mM EDTA 0 . 018 m 

0.24 vtM A-33 primer 0 .3 pmoles/7uL 

Total Volume 5 uL 

The third separately labeled group of 
20 packages are the reaction concentrates. The reaction 
concentrate may be provided in the kit as' four 
separate labeled packages, i.e. concentrate G , 
concentrate A, concentrate T and concentrate C. Each 
of the reaction concentrates will contain the same 
25 chemicals at the same concentrations except the 
concentrations of dNTPs and ddNTPs in each will be 
different. The ratio of ddNTP to the dNTP have been 
adjusted experimentally for each concentrate to 
achieve balanced lane distribution of the radioactive 
.30 labeled dNTP; Balanced lane distribution leads to 
uniform appearance of photographs showing the 
oligonucleotide fragment distribution. This results 
in more precision and accuracy in reading sequence 
from the photographic film. 
35 A very useful reaction concentrate is 

presented below. 
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Concent rat ion of 
Final Reaction 

Mixture 

0.25p/pL Tag pol 0.5u/7uL 

5 350|ig/ml BSA 100 ug/ml 

3.5 mM DTT 1 mM 

17.5 mM Mg Acetate 5 mM 

25 mM K Glutamate 7.1 mM 

13.1 mM K 2 HP0 4 3.75 mM 

10 4.4 mM KH 2 P0 4 1.25 mM 

0.35% Triton X-100 0 . 1% 

40% Glycerol 11.4% 

and a dNTP/ddNTP concentrate selected from Table I 
below. 
15 Total Volume 2 uL 

Table I 

d NTP Concentrations CyiM) for Reaction Concentrates 

20 G A T C . 

daGTP 22.75 70 70 70 

dATP 105 17.5 105 105 

dTTP 52.5 52.5 17.5 52.5 

dCTP 10.5 10.5 10.5 10.5 

25 ddGTP 113.75 - 

ddATP - 577.5 

ddTTP " - - 490 

ddCTP - - - 231 

30 The final reaction concentrations dNTP and 

ddNTP are shown in Table II based on the concentra- 
tions of the various chemical components in each 
package of the above kit. The nucleotides daGTP and 
dGTP are interchangeable. 



35 



WO 90/08839 



PCT/US90/00440 



-16- 
Table II 

Final Reaction Concentrations in T iM 

G A T C 

5 daGTP 6.5 20 20 20 

dATP 30 5 30 30 

dTTP •• 15 15 5 15 

d.CTP 3 3 3 3 

ddGTP 32.5 

10 ddATP 165 

ddTTP . 140 

ddCTP - 66 

dd/d ratio 5 33 28 22 

15 

In another embodiment of the kit the 
reaction concentrates G, A, T and C can also include 
the primer and/or the radioactive labeled dNTP. 

In another embodiment of the kit, a single 

20 reaction concentrate may include all chemicals except 
the dNTPs and/or the selected ddNTP. The dNTPs and 
selected ddNTP could be separately packaged according 
to Table I. Each of these separately packaged dNTP 
mixtures would be added to the reaction concentrate, 

25 as appropriate, at the time of use. 

In manufacturing the reaction concentrate, 
the order of addition is relatively unimportant, 
except the enzyme should be added last. Generally 
the order of addition is: salts and buffers, 

30 nucleotides,' other reagents, glycerol and enzyme with 
thorough mixing upon each addition, carried out at 
ice temperature. 

Using the kit described above, in any of the 
described formats, the method of sequencing would 

35 generally comprise the steps of: 
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a) providing a sequencing vector comprising an 
unknown nucleotide sequence adjacent to a 3 1 end of a 
known nucleotide chain in the vector; 

b) mixing the vector with a primer and a 

5 radioactive labeled dNTP; wherein the primer has a 
nucleotide sequence which is complementary to a 
portion of the known vector DNA sequence that is 
adjacent to the unknown DNA sequence; 

c) dividing the mixture b) into four equal 

10 volumes in four different vessels labeled G, A, T and 
C; 

d) providing a reaction mixture by; 

i) adding to the vessel designated G a 
reaction concentrate according to the present 

15 invention comprising ddGTP; 

ii) adding to the vessel designated A a 
reaction concentrate according to the present 
invention comprising ddATP; 

iii) adding to the vessel designated T a 
20 reaction concentrate according to the present 

invention comprising ddTTP; and 

iv) adding to the vessel designated C a 
reaction concentrate according to the present 
invention comprising ddCTP; 

25 e) incubating the four vessels G, A, T and C at 

a temperature of 70-74°C from 15 to 30 minutes; 

f) terminating the reactions by the addition of 
3 to 5 nL of a reaction stop mixture to each of the 
four tubes; 

30 g) separating electrophoret ically the oligo- 

nucleotides formed in e) according to length; and 

h) exposing the electrophoretic separations to 
a photographic film which is sensitive to the 
radiation generated by the radioactive dNTP to 

35 establish the nucleotide sequence of the unknown 
nucleotide chain. 
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In a typical reaction, a user provides a 
quality single-stranded DNA preparation, typically 
lug (0.4 pmoles) of M13mpl8 DNA, at a concentration 
of >100ug/ml. The user combines the DNA, 5uL 
5 of the Primer/Buffer, water and radioactive nucleo- 
tide (usually X\xCi of 32 P or lOuCi of 35 S, 
alpha-labeled dCTP at >600 Ci/mM), to a total 
volume of 21uL. This DNA mixture is divided 
equally (5uL) among four tubes (preferably 0.5ml 

10 microfuge tubes), designated for the essential G— , 
A-, T- and C-specific reactions. These tubes are 
placed at 70-74°C. Then 2jjL of the appropriate 
specific Reaction Concentrate is mixed into the 
designated tubes. The tubes are capped and the 

15 incubation continued at the 70-74°C for 30 minutes. 

The reaction can be terminated with the 
addition of 3uL of reaction stop mixture. Reaction 
stop mixtures are well known. EPO 0 258 017 describe 
such mixtures as including EDTA to chelate magnesium 

20 ion, phenol, SDS or CHCl^ . A useful mixture in 
formamide solvent is 
0.02 M EDTA; 

0.07 M Tris (to neutralize EDTA); 
0.05% w/v brophenol blue; and 
25 0.02% w/v xylene cyanol. 

The latter two components are anionic 
chromophores to provide a visual assessment of the 
progress of electrophoresis. 

While the main advantage of using Taq pol 
30 for sequencing resides is its ability to synthesize 
at high temperature (minimizing structural interfer- 
ence by the template strand), a considerable benefit 
realized by this invention is its adaptability to 
double— stranded DNA (dsDNA) sequencing including 
35 supercoil sequencing. 
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The principle of double— stranded or 
supercoiled sequencing depends upon a fraction of 
supercoils being irreversibly denatured to a 
single-stranded state, thereby being subject to 
5 primed synthesis essential to sequencing reactions. 

A significant fraction of irreversibly 
denatured supercoils is obtained by alkaline 
denaturation and acid neutralization of supercoiled 
DNA. Normal double— stranded DNA may be similarly 
10 treated, resulting is higher fraction of single- 
stranded DNA. 

In practice, alkaline denaturation and 
subsequent acid neutralization of supercoil or 
double-stranded DNA will result in an accumulated 
15 salt concentration that is high and a pH that is 

relatively undefined because of the relatively small, 
inaccurate volumes necessitated by this process. It 
is generally necessary to repurify the treated DNA to 
eliminate the salt and control the pH, since all 
20 other extant sequencing protocols will not tolerate 
elevated salts and/or a buffered pH that would 
overwhelm the buffer system (usually Tris) utilized 
in those methods and necessary to the activity of 
those enzymes. 

25 Phosphate buffer has the highest degree of 

thermal stability of any commonly used buffer. The 
phosphate buffer was chosen for the reaction 
concentrate since the pH must be controlled for the 
benefit of long-term storage at temperatures as low 

30 as -80°C, and reaction temperatures as high as 80°C. 
Conversely, Tris buffer has among the lowest thermal 
stability of any buffer, the pK of which lowers more 
than two full units from 0°C to 70°C. Hence, the 
supercoiled sequencing reaction can be carried out 

35 with the method of this invention without having to 
repurify the denatured DNA. 
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Th is is accomplished by: 

a) providing a double-stranded sequencing 
vector having a lacZ nucleotide sequence into which 
dsDNA having an unknown nucleotide sequence has been 

5 inserted adjacent to a 3 1 end of a known nucleotide 
chain in the vector ; 

b) denaturing the vector with an alkaline 
solution to form ssDNA; 

c) neutralizing the denatured DNA containing 
10 solution; and 

d) carrying out steps b) through h) of the 
nucleotide sequencing method described hereinbefore. 

In a typical example of the above sequencing 
method, a several uL volume of supercoiled DNA (in 

15 a dilute buffer, e.g. ImM Tris, O.lmM EDTA, pH 8) is 
denatured with luL of 0.1M KOH at 37 *C for 2 
minutes. More severe treatment is usually not 
necessary. On ice, the primer and the radioactive 
dNTP are added, and the denatured DNA is neutralized 

20 with the addition of IpL of 0 . 2M TrisCl. Water is 
then added to obtain a 21 to 23 uL volume. The 
final salt and pH will generally be inaccurate. 
However, the method of this invention can be used 
without further adjustments. In theory, the 

25 resulting salt system consists of a 30-50mM Tris 

buffer at a pH of 7-9. Upon heating this solution to 
70°C for enzymatic reaction, the influence of the ' 
Tris buffer becomes negligible, since the phosphate 
buffer contributed by the reaction concentrate 

30 stabilizes the final pH of the entire mixture. 

Normal single-stranded sequencing occurs unperturbed, 
and supercoil sequencing occurs at the expected 
reduced efficiency, commensurate with the fraction of 
irreversibly denatured supercoils. Hence, supercoil 

35 sequencing is accomplished without repurif ication of 
the denatured supercoiled DNA, using essentially the 
same methodology as single strand sequencing; 
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The invention has been described in detail 
with particular reference to preferred embodiments 
thereof, but it will be understood that variations 
and modifications can be. effected within the spirit 
5 and scope of the invention. 
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We claim: 

1. A nucleotide sequencing reaction 
concentrate comprising: 

a) sufficient thermostable polymerase to 
5 provide from 100 to 500 I.U./mL; 

b) from 0.3 to 30 mM of a reducing agent 
selected from dithiothreitol and ft-mercaptoethanol ; 

c) sufficient phosphate buffer to maintain a pH 
of about 7.5; 

10 d> at least 40% glycerol; 

e) from 15 to 70 uM dGTP or daGTP; 

f) from 15 to 150 uM dATP; 

g) from 15 to 150 uM dTTP; 

h) from 10 to 18 uM dCTP; and 

15 i) one member selected from the group 

consisting of: 

i) sufficient ddGTP to provide a ddGTP to 
dGTP or daGTP ratio of about 5; 

ii) sufficient ddATP to form a ddATP to 
20 dATP ratio of about 33; 

iii) ddTTP in a sufficient amount to form a 
ddTTP to dTTP ratio of about 28; and 

iv) ddCTP in a sufficient amount to form a 
ddCTP to dCTP ratio of about 22. 

25 2. The concentrate of claim 1 wherein the 

thermostable polymerase is Thermus aquaticus 

polymerase . 

3. The reaction concentrate of claim 2 

also comprising: 
30 a) from 175 to 700ug/mL of concentrate of a 

protein selected from the group consisting of gelatin 

and bovine serum albumin; 

b) at least 25 mM of potassium glutamate; and 

c) from 9 to 35 mM of a magnesium salt. 

35 4. The reaction concentrate of claim 2 

also comprising: 
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a) from 0.1 to 0.47, amount of a non— ionic 
surfactant; 

b) from 0.1 to 10 pCi/yL of an 
alpha-labeled radioactive dNTP; 

5 c) from 9 to 35 mM of a magnesium salt; and 

d) from 0.1 to 1.0 uM of an oligonucleotide 
primer having the nucleotide sequence: 

5' CCCAGTCACGACGTTGTAAAACGACGGCCAGTG 3' 

5. A nucleotide sequencing reaction 
10 concentrate comprising about: 

a) 0.25 I.U./jiL of Thermus aquat icus 
polymerase ; 

b) 350 micro g/mL of bovine serum albumin; 

c) 3.5 mM dithiothreitol ; 

15 d) 17.5 mM magnesium acetate; 

e) 25 mM of potassium glutamate; 

f) 13.1 mM of K 2 HP0 4 ; 

g) 4.4 mM of KH 2 P0 4 ; 

h) 0.35% of Triton X 100 surfactant; 

20 i) 407. glycerol; and one mixture selected from 

the group consisting of: 

A) about 22.7 viM dGTP or daGTP; 
105 uM dATP; 

52.5 uM of dTTP; 
25 10.5 \iU of dCTP; and 

113.7 uM of ddGTP; 

B) about 70 uM of dGTP or daGTP; 
17.5 uM dATP; 

52.5 uM dTTP; 
30 10.5 yM of dCTP; and 

577 uM of ddATP; 
. C) about 70 \iH of dGTP or daGTP; 

105 uM of dATP; 

17.5 uM of dTTP; 
35 10.5 uM of dCTP; and 

490 yM of ddTTP; 
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D) about 70 of dGTP or daGTP; 
105 uM of dATP; 
52.5 uM of dTXP; 
10.5 uM of dCTP; and 
5 231 uM of ddCTP. 

6. The reaction concentrate of claim 5 
also comprising 0.15 \xH of an oligonucleotide 
primer having the sequence: 

5' CCCAGTCACGACGTTGTAAAACGACGGCCAGTG 3' 
10 7. The reaction concentrate of claim 6 

also comprising 1 to 10 \iCi of alpha-labeled 
radioactive dCTP per 2 \xL of reaction concentrate. 

8. The reaction concentrate of claim 7 

wherein the radioactive labeled nucleotide is labeled 
32 35 
15 with the element P or J ^S. 

9. A method of oligonucleotide sequencing 
comprising the steps of : 

a) providing a sequencing vector comprising an 
unknown nucleotide sequence adjacent to a 3 ' end of a 

20 known nucleotide chain in the vector; 

b) mixing the vector with a primer and a 
radioactive labeled dNTP; wherein the primer has a 
nucleotide sequence which is complementary to a 
portion of the known vector DNA sequence that is 

25 adjacent to the unknown DNA sequence; 

c) dividing the mixture b) into four equal 
volumes in four different vessels labeled G, A, T and 
C; 

d) providing a series of reaction mixtures by; 
30 i) adding to the vessel designated G a 

reaction concentrate according to claim 1 or 2 
comprising ddGTP; 

ii) adding to the vessel designated A a~ 
reaction concentrate according to claim 1 or 2 
35 comprising ddATP; 
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iii) adding to the vessel designated T a 
reaction concentrate according to claim 1 or 2 
comprising ddTTP; and 

iv) adding to the vessel designated C a 
5 reaction concentrate according to claim 1 or 2 

comprising ddCTP; 

e) " incubating the four vessels G, A, T and C at 
a temperature of 70-74°C from 15 to 30 minutes; 

f) terminating the reactions by the addition of 
10 3 to 5 uL of a reaction stop mixture to each of the 

four tubes ; 

g) separating electrophoretically the oligo- 
nucleotides formed in e) according to length; and 

h) exposing each gel to a photographic film 
15 that is sensitive to the radiation generated by the 

radioactive dNTP to establish the nucleotide sequence 
of the unknown nucleotide chain. 

10. A method according to claim 9 wherein 
each reaction concentrate G, T ( A and C already 

20 includes a radioactive dNTP thereby obviating the 
need^to include said radioactive dNTP in step b) of 
claim 9 . 

11. The method of claim 9 wherein each 
reaction concen- trate, G, T, A and C, also includes 

25 the primer thereby obviating the need to include said 
primer in step b) of claim 9. 

12. The method of claim 9 wherein the 
sequencing vector has a lacZ nucleotide sequence. 

13. The method of claim 9 wherein the 
30 sequencing vector is selected from the group 

consisting of M13mpl8 DNA, pUC , pIBI and Bluescript. 

14. The method of claim 9 wherein the 
primer has the nucleotide sequence: 

5' CCCAGTCACGACGTTGTAAAACGACGGCCAGTG 3' 
35 15. The method of claim 9 or 11 in which 

the sequencing vector is M13mpl8 and the reaction 
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concentrates have compositions according to claim 2, 
3, A or 5. 

16. A nucleotide sequencing kit comprising: 

a) a control sequencing vector having a lacZ 
5 nucleotide sequence and luL of alpha-labeled 

radioactive dCTP in a first container; 

b) an oligonucleotide primer having the 
sequence: 

5' CCCAGTCACGACGTTGTAAAACGACGGCCAGTG 3' 
10 in a second container; and 

c) a reaction concentrate according to claim 2\ . 
3 or 5 in a second set of containers . 

17. A nucleotide sequencing kit comprising 
an oligonucleotide primer having the sequence: 

15 5 1 CCCAGTCACGACGTTGTAAAACGACGGCCAGTG 3' 

in a first container and a reaction concentrate 
according to claim 2, 3 or 5 in a second set of 
containers . 

18. A nucleotide sequencing kit comprising 
20 a reaction concentrate according to claim 6, 7 or 8. 

19. An oligonucleotide having the 
nucleotide sequence: 

5' CCCAGTCCACGACGTTGTAAAACGACGGCCAGTG 3' 

20. A method of sequencing double-stranded 
25 (dsDNA), comprising the steps of: 

a) providing a double-stranded sequencing 
vector having a lacZ nucleotide sequence into which 
dsDNA having an unknown nucleotide sequence has been 
inserted adjacent to a 3' end of a known nucleotide 

30 chain in the vector; 

b) denaturing the vector with an alkaline 
solution to form ssDNA; 

c) neutralizing the denatured DNA containing 
solution; 

35 d) mixing the vector with a primer, a magnesium 

salt and a radioactive labeled dNTP; wherein the 
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primer has a nucleotide sequence which is complemen- 
tary to at least a portion of Known vector DNA 
sequence that is adjacent to the unknown DNA sequence; 

e) dividing the mixture d) into four equal 

5 volumes in four different vessels labeled G, A, T and 
C; and 

f ) providing a series of reaction mixtures by: 

i) adding to the vessel designated G a 
reaction concentrate according to claim 1 or 2 

X0 comprising ddGTP; 

ii) adding to the vessel designated A a 
reaction concentrate according to claim 1 or 2 
comprising ddATP; 

iii) adding to the vessel designated T a 
15 reaction concentrate according to claim 1 or 2 

comprising ddTTP; 

iv) adding to the vessel designated C a 
reaction concentrate according to claim 1 or 2 
comprising ddCTP; 

20 g) incubating the four vessels G, A, T and C at 

a temperature of 70-74°C from 15 to 30 minutes; 

h) terminating the reactions by the addition of 
3 to 5 uL of a reaction stop mixture to each of the 
four tubes; 

25 i) separating electrophoretically the formed 

oligonucleotides according to length; and 

j) exposing each gel to a photographic film 
which is sensitive to the radiation generated by the 
radioactive dNTP to establish the sequence of the 

30 supercoiled DNA. 
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